139 research outputs found

    Fourmidable: a database for ant genomics

    Get PDF
    BACKGROUND: Fourmidable is an infrastructure to curate and share the emerging genetic, molecular, and functional genomic data and protocols for ants. DESCRIPTION: The Fourmidable assembly pipeline groups nucleotide sequences into clusters before independently assembling each cluster. Subsequently, assembled sequences are annotated via Interproscan and BLAST against general and insect-specific databases. Gene-specific information can be retrieved using gene identifiers, searching for similar sequences or browsing through inferred Gene Ontology annotations. The database will readily scale as ultra-high throughput sequence data and sequences from additional species become available. CONCLUSION: Fourmidable currently houses EST data from two ant species and microarray gene expression data for one of these. Fourmidable is publicly available at http://fourmidable.unil.ch

    DeltaRpkm: an R package for a rapid detection of differential gene presence between related bacterial genomes

    Get PDF
    Comparative genomics has seen the development of many software performing the clustering, polymorphism and gene content analysis of genomes at different phylogenetic levels (isolates, species). These tools rely on de novo assembly and/or multiple alignments that can be computationally intensive for large datasets. With a large number of similar genomes in particular, e.g., in surveillance and outbreak detection, assembling each genome can become a redundant and expensive step in the identification of genes potentially involved in a given clinical feature.Results: We have developed deltaRpkm, an R package that performs a rapid differential gene presence evaluation between two large groups of closely related genomes. Starting from a standard gene count table, deltaRpkm computes the RPKM per gene per sample, then the inter-group δRPKM values, the corresponding median δRPKM (m) for each gene and the global standard deviation value of m (sm). Genes with m >  = 2  ∗ sm (standard deviation s of all the m values) are considered as “differentially present” in the reference genome group. Our simple yet effective method of differential RPKM has been successfully applied in a recent study published by our group (N = 225 genomes of Listeria monocytogenes) (Aguilar-Bultet et al. Front Cell Infect Microbiol 8:20, 2018).Conclusions: To our knowledge, deltaRpkm is the first tool to propose a straightforward inter-group differential gene presence analysis with large datasets of related genomes, including non-coding genes, and to output directly a list of genes potentially involved in a phenotype

    The PROSITE database, its status in 1999

    Get PDF
    The PROSITE database (http://www.expasy.ch/sprot/prosite.html) consists of biologically significant patterns and profiles formulated in such a way that with appropriate computational tools it can help to determine to which known family of protein (if any) a new sequence belongs, or which known domain(s) it contain

    Visualization and quality assessment of de novo genome assemblies

    Get PDF
    Summary: Recent technological progress has greatly facilitated de novo genome sequencing. However, de novo assemblies consist in many pieces of contiguous sequence (contigs) arranged in thousands of scaffolds instead of small numbers of chromosomes. Confirming and improving the quality of such assemblies is critical for subsequent analysis. We present a method to evaluate genome scaffolding by aligning independently obtained transcriptome sequences to the genome and visually summarizing the alignments using the Cytoscape software. Applying this method to the genome of the red fire ant Solenopsis invicta allowed us to identify inconsistencies in 7%, confirm contig order in 20% and extend 16% of scaffolds. Contact: [email protected]; [email protected] Availability: Scripts that generate tables for visualization in Cytoscape from FASTA sequence and scaffolding information files are publicly available at https://github.com/ksanao/TGNet. Supplementary Information: Supplementary data are available at Bioinformatics onlin

    Swiss EMBnet node web server

    Get PDF
    EMBnet is a consortium of collaborating bioinformatics groups located mainly within Europe (http://www.embnet.org). Each member country is represented by a ‘node', a group responsible for the maintenance of local services for their users (e.g. education, training, software, database distribution, technical support, helpdesk). Among these services a web portal with links and access to locally developed and maintained software is essential and different for each node. Our web portal targets biomedical scientists in Switzerland and elsewhere, offering them access to a collection of important sequence analysis tools mirrored from other sites or developed locally. We describe here the Swiss EMBnet node web site (http://www.ch.embnet.org), which presents a number of original services not available anywhere els

    A novel bioinformatics pipeline to discover genes related to arbuscular mycorrhizal symbiosis based on their evolutionary conservation pattern among higher plants

    Get PDF
    Genes involved in arbuscular mycorrhizal (AM) symbiosis have been identified primarily by mutant screens, followed by identification of the mutated genes (forward genetics). In addition, a number of AM-related genes has been identified by their AM-related expression patterns, and their function has subsequently been elucidated by knock-down or knock-out approaches (reverse genetics). However, genes that are members of functionally redundant gene families, or genes that have a vital function and therefore result in lethal mutant phenotypes, are difficult to identify. If such genes are constitutively expressed and therefore escape differential expression analyses, they remain elusive. The goal of this study was to systematically search for AM-related genes with a bioinformatics strategy that is insensitive to these problems. The central element of our approach is based on the fact that many AM-related genes are conserved only among AM-competent species.Results: Our approach involves genome-wide comparisons at the proteome level of AM-competent host species with non-mycorrhizal species. Using a clustering method we first established orthologous/paralogous relationships and subsequently identified protein clusters that contain members only of the AM-competent species. Proteins of these clusters were then analyzed in an extended set of 16 plant species and ranked based on their relatedness among AM-competent monocot and dicot species, relative to non-mycorrhizal species. In addition, we combined the information on the protein-coding sequence with gene expression data and with promoter analysis. As a result we present a list of yet uncharacterized proteins that show a strongly AM-related pattern of sequence conservation, indicating that the respective genes may have been under selection for a function in AM. Among the top candidates are three genes that encode a small family of similar receptor-like kinases that are related to the S-locus receptor kinases involved in sporophytic self-incompatibility.Conclusions: We present a new systematic strategy of gene discovery based on conservation of the protein-coding sequence that complements classical forward and reverse genetics. This strategy can be applied to diverse other biological phenomena if species with established genome sequences fall into distinguished groups that differ in a defined functional trait of interest

    Complete genome sequences of virulent mycoplasma capricolum subsp. capripneumoniae strains F38 and ILRI181

    Get PDF
    Contagious caprine pleuropneumonia (CCPP) caused by Mycoplasma capricolum subsp. capripneumoniae is a severe epidemic affecting mainly domestic Caprinae species but also affects wild Caprinae species. M. capricolum subsp. capripneumoniae belongs to the “Mycoplasma mycoides cluster.” The disease features prominently in East Africa, in particular Kenya, Tanzania, and Ethiopia. CCPP also endangers wildlife and thus affects not only basic nutritional resources of large populations but also expensively built-up game resorts in affected countries. Here, we report the complete sequences of two M. capricolum subsp. capripneumoniae strains: the type strain F38 and strain ILRI181 isolated druing a recent outbreak in Kenya. Both genomes have a G+C content of 24% with sizes of 1,016,760 bp and 1,017,183 bp for strains F38 and ILRI181, respectively

    PII: S0968-0004(01)01835-7

    Get PDF
    Research Update We anticipate that further experimental investigation of the specific role of each of the five classes of Brix domain proteins in eukaryote model organisms, especially in yeast, will yield valuable new insights about ribosome synthesis and be key to our understanding of the ribosome biogenesis pathway. Acknowledgements We are grateful to E. Bogengruber, M. Breitenbach, F.M. Jantsch and G. Lepperdinger for supplying experimental data on sequence and function of Brix (AF319877) and yol077c before publication and for extensive discussion of the Brix sequence analysis results. This research was supported by Boehringer-Ingelheim International. Ubiquitin is a small protein, highly conserved among eukaryotes, that becomes covalently attached to both itself and a variety of cellular proteins 1,2 . The role of this ubiquitination is mostly to target proteins to the 26S proteasome degradation pathway 3 . In some cases, monoubiquitination (e.g. of histones) does not lead to degradation, but instead regulates other cellular processes such as chromatin remodeling 4 . Recently, several reports have described a role for monoubiquitination in a different pathway of protein degradation -the endocytosis and subsequent proteolysis of receptors and other transmembrane proteins by the vacuole or the lysosome 5,6 . According to the current model, the decisions about which protein is to be degraded at a specific time is made by the ubiquitination machinery, often in response to a prior event such as phosphorylation. Consequently, both the proteasome and the endocytosis machinery need a mechanism by which to faithfully recognize ubiquitinated proteins. The 26S proteasome comprises two main particles: the 20S core proteasome and the 19S regulatory complex. Subunit S5a (also known as Rpn10) of the 19S regulator binds polyubiquitin chains and has a preference for chains containing four or more ubiquitin monomers. The ubiquitin-interacting region has been mapped to two short, related motifs that are found in all members of the S5a family 7 . Using these regions, which comprise ~20 residues, as a starting point, we searched for other potential ubiquitin-binding sequences. Specifically, we used a combination of iterative database searches with generalized profiles, and Hidden Markov Models (profile-HMMs) 8 . Only sequences that matched a profile or an HMM derived from previously established family members, with error probabilities of p < 0.01, were used for subsequent iteration cycles. After eight cycles, the sequence motif 4 Migeon, J.C. et al. (1999) converged to a set of proteins shown in An observation of particular interest is the occurrence of UIMs in four classes of proteins involved in receptor endocytosisthe Eps15 subfamily of EH-domain proteins, the epsin subfamily of ENTH-domain proteins and two families of VHS-domain proteins, including the FYVE-finger proteins HRS and Vps27, and the SH3-domain proteins STAM and HBP. Eps15 is phosphorylated on Tyr850 by the ligandactivated epidermal growth factor (EGF) receptor and this phosphorylation is required for subsequent receptor endocytosis 10 . Furthermore, Eps15 binds to epsin, and both of these proteins interact with components of the endocytosis machinery, including clathrin and the AP-2 complex 11,12 . The fact that the phosphorylation site of Eps15 is immediately adjacent to a tandem UIM suggests that this motif might be involved in the regulated endocytosis of the EGF receptor. Liquid facets (lqf), an epsin from Drosophila melanogaster, was identified in a genetic screen as a dominant enhancer of the fat facets (faf) mutant eye phenotype 13 . This relationship links the gene encoding lqf with the ubiquitin system because faf is a deubiquitinating enzyme. An additional link between Eps15 and ubiquitin recognition is provided by the yeast protein Ede1p, the closest homolog of mammalian Eps15 (Ref. 14): in the yeast protein, the UIM is replaced by a UBA domain, a homology domain known to bind ubiquitin 15 . Proteins with an N-terminal VHS domain can be divided into three subtypes on the basis of their domain organization: (1) Vps27-HRS-like (type A); (2) STAM-HBP-like (type B); and (3) other proteins (type C) (se

    A petunia GRAS transcription factor controls symbiotic gene expression and fungal morphogenesis in arbuscular mycorrhiza

    Get PDF
    Arbuscular mycorrhiza (AM) is a mutual symbiosis that involves a complex symbiotic interface over which nutrients are exchanged between the plant host and the AM fungus. Dozens of genes in the host are required for the establishment and functioning of the interaction, among them nutrient transporters that mediate the uptake of mineral nutrients delivered by the fungal arbuscules. We have isolated in a genetic mutant screen a petunia GRAS-type transcription factor, ATYPICAL ARBUSCULE (ATA), that acts as the central regulator of AM-related genes and is required for the morphogenesis of arbuscules. Forced mycorrhizal inoculations from neighbouring wild type plants revealed an additional role of ATA in restricting mycorrhizal colonization of the root meristem. The lack of ATA, which represents the orthologue of RAM1 in Medicago truncatula, renders the interaction completely ineffective, hence demonstrating the central role of AM-related genes for arbuscule development and function

    PeroxiBase: a database with new tools for peroxidase family classification

    Get PDF
    Peroxidases (EC 1.11.1.x), which are encoded by small or large multigenic families, are involved in several important physiological and developmental processes. They use various peroxides as electron acceptors to catalyse a number of oxidative reactions and are present in almost all living organisms. We have created a peroxidase database (http://peroxibase.isb-sib.ch) that contains all identified peroxidase-encoding sequences (about 6000 sequences in 940 organisms). They are distributed between 11 superfamilies and about 60 subfamilies. All the sequences have been individually annotated and checked. PeroxiBase can be consulted using six major interlink sections ‘Classes', ‘Organisms', ‘Cellular localisations', ‘Inducers', ‘Repressors' and ‘Tissue types'. General documentation on peroxidases and PeroxiBase is accessible in the ‘Documents' section containing ‘Introduction', ‘Class description', ‘Publications' and ‘Links'. In addition to the database, we have developed a tool to classify peroxidases based on the PROSITE profile methodology. To improve their specificity and to prevent overlaps between closely related subfamilies the profiles were built using a new strategy based on the silencing of residues. This new profile construction method and its discriminatory capacity have been tested and validated using the different peroxidase families and subfamilies present in the database. The peroxidase classification tool called PeroxiScan is accessible at the following address: http://peroxibase.isb-sib.ch/peroxiscan.ph
    corecore